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Abstract 

Let X be randomly chosen from {—1, 1}", and let Y be randomly chosen from the standard 
spherical Gaussian on R" . For any (possibly unbounded) polytope P formed by the intersection 
of k halfspaccs, we prove that 

|Pr [X e P] - Pr [y e P] I < log®/^ k ■ A, 

where A is a parameter that is small for polytopes formed by the intersection of "regular" 
halfspaces (i.e., halfspaces with low influence). The novelty of our invariance principle is the 
polylogarithmic dependence on k. Previously, only bounds that were at least linear in k were 
known. 

We give two important applications of our main result: 

• A bound of log*^'^-* k ■ e^l^ on the Boolean noise sensitivity of intersections of k "regular" 
halfspaces (previous work gave bounds linear in fc). This gives a corresponding agnostic 
learning algorithm for intersections of regular halfspaces. 

• A pseudorandom generator (PRG) with seed length 0(lognpoly(logA:, 1/(5)) that (5-fools 
all polytopes with k faces with respect to the Gaussian distribution. 

We also obtain PRGs with similar parameters that fool polytopes formed by intersection of 
regular halfspaces over the hypercube. Using our PRG constructions, we obtain the first deter- 
ministic quasi-polynomial time algorithms for approximately counting the number of solutions 
to a broad class of integer programs, including dense covering problems and contingency tables. 



1 Introduction 



An important theme in theoretical computer science over the last two decades has been the use- 
fulness of translating a combinatorial problem over a discrete domain (e.g., { — 1, 1}") to a problem 
in continuous space. The notion of convex relaxation, for example, is now a standard technique in 
the design of algorithms for optimization problems. More recently, the study of analytic properties 
of Boolean functions (e.g., Fourier spectra and sensitivity) has been a fundamental tool for proving 
results in hardness of approximation [Wol08, O'D08] and learning theory [Man94]. 

The influential work of Mossel, O'Donnell, and Oleszkiewicz [MOO05] proving the "Majority 
Is Stablest" conjecture has led to a rich collection of hardness results for constraint satisfaction 
problems, most notably for the Max-Cut problem. The crux of their work is an invariance principle 
relating the behavior of low-degree polynomials over the uniform measure on {—1,1}" to their 
behavior with respect to Gaussians: 

Theorem 1.1 (invariance principle [MOO05]). Let P be a multilinear polynomial such that \\P\\ = 
1. Then, for any t G M, 



Pr [Pix)>t]- Pr [P(x)>t] 



< T. 



Here A/"" is the standard multivariate spherical Gaussian distribution on R"; the parameter r 
depends on the coefficients of P and is small if P is "regular" in the sense that the "influence" of 
each variable in P is small. 

Roughly speaking, the above invariance principle says that the cumulative distribution function 
(cdf) of a polynomial over {—1, l}" is close to the cdf of a polynomial over AA" if the coefficients 
of the polynomial are sufficiently regular. Although developed in the context of hardness of ap- 
proximation, their invariance principle has also found applications in the theory of social choice 
[O'D08] and more recently to the construction of pseudorandom generators for polynomial threshold 
functions [MZ09]. 

The main result of this paper is an invariance principle for characteristic functions of polytopes. 
Recall that a polytope /C is a (possibly unbounded) convex set in M" formed by the intersection 
of some finite number of supporting halfspaces. We refer to /C as a /c-polytope if it is equal to the 
intersection of k halfspaces. Our main theorem is as follows: 

Theorem 1.2 (invariance principle for polytopes; see Theorem 3.1 for exact statement). Let IC be 
a k-polytope. Then, 



Pr [xGiC]- Pr [x £ fC] 

xG„{-l,l}" x^Af"' 



< log^/^ k ■ A. 



The parameter A depends on the coefficients of the bounding hyperplanes of IC and is small if 
these coefficients are sufficiently regular. In particular, if /C equals {x \ W^x < 9} for W G ^"x'^ 
and 9 G R'^, and each column u of 1^ is e-regular, i.e., satisfies J27=i ''^t — ^^ll^lli' then A is less 
than e^/®. Note that there is no restriction on the vector 9. Our invariance principle also holds 
more generally for any product distribution that is hypercontractive and whose first four moments 
are appropriately bounded. 

The novelty of our theorem is the dependence of the error on k. Applying a recent result due to 
Mossel [MosOS], it is possible to obtain a statement similar to Theorem 1.2 with an error term that 
has a polynomial dependence on k. Achieving polylogarithmic dependence on k, however, is much 
harder, and we need to use some nontrivial results from the analysis of convex sets in Gaussian 
space. 
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The case k = 1, a single halfspace, is equivalent to the classical Berry-Esseen theorem [Fel68], 
a fundamental theorem from probability and statistics giving a quantitative version of the Central 
Limit Theorem. We can therefore view our principle as a generalization of the Berry-Esseen theorem 
for polytopes. Further, understanding the structure of integer points in polytopes (that is, solutions 
to integer programs) is an important topic in computer science [BV08], optimization [Zie95], and 
combinatorics [BR07], and we believe our invariance principle will find many applications. 

In this paper, we use our invariance principle to derive new results in learning theory and 
derandomization. 

1.1 Results on Noise Sensitivity of Intersections of Halfspaces 

Noise sensitivity of Boolean functions introduced in the seminal works of [KKL88], [BKS99] is an 
important notion in the analysis of Boolean functions. Roughly speaking, the noise sensitivity of a 
Boolean function / measures the probability over a randomly chosen input x that / changes sign 
if each bit of x is flipped independently with probability 5. 

Bounds on the noise sensitivity of Boolean functions have direct applications in hardness of 
approximation [HasOl, KKMO07], hardness amplification [O'D04], circuit complexity [LMN93], the 
theory of social choice [Kal05], and quantum complexity [ShiOO]. Here, we focus on applications 
in learning theory, where it is known that bounds on the noise sensitivity of a class of Boolean 
functions yield learning algorithms that succeed in harsh noise models such as the agnostic model 
of learning [KKMS08]. 

A direct application of our invariance principle Theorem 1.2 gives the following new bound on 
the noise sensitivity of intersections of regular halfspaces: 

Theorem 1.3 (learning application: noise sensitivity of intersections of halfspaces). Let f be 

computed by the intersection ofk, e-regular halfspaces. Then the Boolean noise sensitivity of f for 
noise rate e is at most (log/c)*^^^) • e^/^. 

Applying a result of Kalai et al. [KKMS08] and Klivans et al. [KOS04], the above theorem 
implies that intersections of k, e-regular halfspaces are agnostically learnable with respect to the 
uniform distribution on { — 1, 1}" in time n^'"^*^' ' for any constant error parameter. In particular, 
intersections of { — 1, 1} halfspaces (oriented majorities) are e-regular and fall into this class. The 
previous best algorithm for learning this concept class, even in the easier PAC model, ran in time 
„o(fc2) ([KOS04, KKMS08]). 

The current best bound for the noise sensitivity of intersection of k arbitrary halfspaces is 
0{ky/e). This bound is obtained by starting with the y/e noise sensitivity bound for a single 
halfspace due to Peres [Per04] and applying a union bound over k halfspaces. On the other hand, 
optimal bounds of @{y/logk^/e) for the related Gaussian noise sensitivity were obtained recently by 
Klivans et al. [KOS08]. We believe that the right order for Boolean noise sensitivity of intersection 
of k halfspaces is Q{^/^ogk^/e) as well. 

Improving the bounds for Boolean noise sensitivity would be of considerable interest, particu- 
larly for the learning theory applications, as learning the class of intersections of halfspaces even 
with respect to specific distributions such as the uniform distribution over {1, —1}" is an important 
open problem in learning theory. We feel that our result is an important step towards improving 
noise sensitivity bounds for intersections of arbitrary (not necessarily regular) halfspaces. 

1.2 Results on Pseudorandom Generators for Polytopes 

Our invariance principle also yields new results for several problems in derandomization. Recall 
the following definition of pseudorandom generators (PRCs): 
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Definition 1.4. Let fi be a distribution overM. A function G : {0, 1}'' — > {1, —1}" is said to 6-fool 
a polytope fC with respect to if the following holds. 



Pr [G{y) G /C] - Pr [X e /C] 



< S. 



Combining our invariance principle with a PRG similar to a recent construction of Meka and 
Zuckerman [MZ09], we obtain a black-box algorithm for approximately counting the number of 
{ — 1, 1}" points in polytopes formed by the intersection of regular halfspaces: 

Theorem 1.5 (PRCs for regular polytopes and approximate counting). For all 6 G (0, 1), there 
exists an explicit PRG G : {0, 1}'' {1, —1}"' with r = 0{{lognlogk)/e) that 6-fools all polytopes 
formed by the intersection of k e-regular halfspaces with respect to all proper and hypercontractive 
distributions /i for e = (5^/(log^'^ k){\og{\/6)). 

The constants above depend on the hypercontractivity constants of /U. We define proper and 
hypercontractive distributions in the next section and remark that the uniform distribution over 
{— 1, 1}*^ is an example of such a distribution. 

Theorem 1.5 implies quasi-polynomial time, deterministic, approximate counting algorithms 
for a broad class of integer programs. For example, dense covering programs such as dense set- 
cover, and {0, l}-contingency tables correspond to polytopes formed by the intersection of e-regular 
halfspaces. For these types of integer programs, we can deterministically approximate, to within 
an additive error e, the number of integer solutions in quasi-polynomial time. 

As stated, our invariance principle applies to polytopes whose bounding hyperplanes have coef- 
ficients that are sufficiently regular. In some cases, however, we can randomly rotate an arbitrary 
polytope so that all the bounding hyperplanes become regular. As such, after applying a suitable 
random transformation (which we derandomize) , we can build PRCs for arbitrary polytopes if the 
underlying distribution is spherically symmetric (e.g., Gaussian): 

Theorem 1.6 (PRCs for Gaussian space). For a universal constant c > and all 5 > clog^ k/n^/^^ , 
there exists an explicit PRG G^f : {0,1}'' M" with r = 0{{logn){log^'^ k)/6^-^) that 6-fools all 
k-polytopes with respect to N . 

Additionally, we prove an invariance principle for polytopes with respect to the uniform distri- 
bution over the n-dimensional sphere S""^. This allows us to easily modify our PRG for polytopes 
in Gaussian space and build PRGs for intersections of spherical caps: 

Theorem 1.7 (PRGs for intersections of spherical caps). For a universal constant c > and all 5 > 
clog^ there exists an explicit PRG G^p : {0, 1}'~ S'"-^ with r = 0((log n)(log^-^ k)/5^-'^) 

that 6-fools all k-polytopes with respect to the uniform distribution over S^~^. 

An immediate consequence of the above PRG construction is a polynomial time derandomiza- 
tion of the Goemans- Williamson approximation algorithm for Max-Cut [GW95] and other similar 
hyperplane based randomized rounding schemes. Observe that this derandomization is a black-box 
derandomization as opposed to some earlier derandomizations of the Goemans- Williamson algo- 
rithm, which are instance-specific (e.g., [MH99]). 

1.3 Proof Outline 

In this section, we give a high level outline of the proof of our invariance principle and contrast it 
with the works of Mossel et al. [MOO05] and Mossel [Mos08]. The proof proceeds in two steps. 
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Step One: As in Mossel et al. [MOO05] and Mossel [Mos08], we first use the Lindeberg method 
(see [PR89]) to prove an invariance principle for smooth functions. By this we mean proving that 



X6{-i,i}" yeA/^'" 



<7, (1-1) 



where ii, ■ ■ ■ ,ik are hnear functions (corresponding to the normals of the faces of the A;-polytope) 
and ^' is a smoothing function. The value 7 will depend on k, the coefficients of the £p's and the 
derivatives of ^. The function ^ is often called a "test" function and is smooth if there is a uniform 
bound on its fourth derivative. Notice here that ^ maps M'^ to M; in [MOO05], they were concerned 
with the value ^{Q{X)) for a low-degree polynomial Q and a univariate test function ^. 

At this point, we could use the A;-wise product of a test function constructed by Mossel et al. to 
approximate the logical AND function. Further, Mossel provides a very general framework for 
multivariate test functions and gives bounds for the overall error incurred by the hybrid argument. 
Here we run into our first difficulty: the standard hybrid argument as used by Mossel et al. and 
Mossel results in a bad dependence on the coefficients of the £p's. In particular, the resulting error 
term is not small even for polytopes formed by the intersection of regular halfspaces. 

To solve this problem, we use a non-standard hybrid argument that groups the input variables 
into blocks. We observe that it is irrelevant in which order we replace Xj's with 1^'s - in fact a 
random order would suffice. Further, we can group the Xj's into blocks and proceed blockwise with 
the hybrid argument. To implement this intuition, we partition [n] randomly into a set of blocks 
and replace all the Xj's within a block by the corresponding l^'s one block at a time. Proceeding 
in this fashion with a random partitioning has a "smoothing effect" on the coefficients of the linear 
functions resulting in a much better bound on the error in terms of the coefficients. 

Roughly speaking, ipi denotes the i'th coefficient of ip, then the standard hybrid arguments of 
[PR89], [MOO05], [MosOB] incur an error proportional to X]iG[n] (™^^pG[fc] which can be as 

large as Q{k) even for regular functions ip. In contrast, our randomized-blockwise-hybrid argument 
only suffers an error of (log A;) • maXp^j^j |^pi|^, which is small for regular functions. It turns out 
that in the above analysis, we can choose the random partitioning into blocks in a 0(log A;)-wise 
independent manner, instead of uniformly at random, and this is crucial for our PRG constructions. 



Step Two: Given the above invariance principle for smooth functions, we now aim to translate 
the closeness in expectation for smooth functions to closeness in cdf distance. Here the smoothness 
of the test function ^ becomes important, and we run into our second problem: the natural choice 
of test function ^ (the multivariate version of the test function from Mossel et al.) leads to an error 
bound on the order of k, rather than poly (log k). To get around this problem, we first observe that in 
Mossel's proof of the multivariate invariance principle as in our randomized-hlockwise-hyhrid argu- 
ment, it suffices to bound the 7i-norm' of the fourth derivative sup2,giRfe(^p ^ ^ ^gj^j {dpOgdrds"^ {x)\) , 
instead of uniformly bounding the fourth derivative sup^^-g^k p g j. ^.^^j.^ddpdgdrds'^ {x)\) . Thus, it suf- 
fices to obtain a smooth approximation of the AND function for which the former quantity is small. 
Fortunately for us, we uncovered a beautiful result due to Bentkus [Ben90], who constructs a smooth 
approximation of the AND function with precisely this property. 

The final difficulty for translating closeness in expectation as in Equation 1.1 to closeness in cdf 
distance is to prove that ^ differs from the characteristic function only on a set of small Gaussian 
measure. To this end, we show that it suffices to bound the Gaussian measure of /oo-neighborhoods 
around the boundary of fc-polytopes. For an /oo-neighborhood of width A, a union bound would 
imply Gaussian measure on the order of kX. At this point, however, we can apply a nontrivial 
result due to Nazarov [Naz03] on the Gaussian surface area of /s-polytopes to get the much better 
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bound of \/log k A. This result of Nazarov was used before by Klivans et al. [KOS08] in the context 
of learning intersections of halfspaces with respect to Gaussian distributions. 



We give an outline of the proofs of the applications of the invariance principle to noise sensitivity 
and PRGs in the corresponding sections. 

1.4 Related Work 

As mentioned earlier, the classical Berry-Esseen theorem [Fel71] from probability, a quantitative 
version of the Central Limit Theorem, gives an invariance principle for the case of a single halfspace 
(i.e., k = 1). More precisely, for any w € M", such that \\w\\ = 1 and each coefficient of w is at 
most e, the Berry-Esseen theorem states that 



Bentkus [Ben03] proves a multidimensional Berry-Esseen theorem for sums of vector-valued 
random variables each with identity covariance matrix, whose error term depends on the Gaussian 
surface area of the test set. Although his paper deals with topics related to our work, his result 
seems to have no implications in our setting. 

There is a long history of research on approximately counting the number of solutions to integer 
programs, especially with regard to contingency tables [JS97, CD03]. However, not much is known 
in terms of deterministic algorithms, and we believe that our deterministic quasi-polynomial time 
algorithms for dense covering problems and dense set cover instances is the first result of its kind. 

Regarding contingency tables. Dyer [Dye03] gave a deterministic, relative-error approximation 
for counting solutions to contingency tables that runs in time exponential in the number of rows. 
In contrast, we obtain an algorithm that runs in quasi-polynomial time in the number of rows 
(however, we do not give a relative-error approximation). Although not stated explicitly before, 
it is easy to see that the pseudorandom generator for small space machines of Impagliazzo et 
al. [INW94] yields a deterministic algorithm for counting n x k contingency tables with additive 
error at most e and run time 2'^(^°g i^k/s))^ rj.^^^ incomparable to our algorithm for contingency 
tables which has run time 2(^°^") P°^y('°s'^'^/'^). In our case, we obtain a polynomial-time, black-box 
derandomization for contingency tables with a constant number of rows (for e = 0(1)). 

For PRGs for intersections of halfspaces, recently Gopalan et al. [GOWZ09] and Diakonikolas 
et al. [DKN09] gave results incomparable to ours. Gopalan et al. give generators for arbitrary 
intersections of k halfspaces with seed length linear in k but logarithmic in 1/5. Diakonikolas 
et al. show that bounded independence fools intersections of quadratic threshold functions and 
in particular, get generators with seed length 0((logn) • poly(A:, 1/e)) fooling intersections of k 
halfspaces. Due to the at least linear dependence on k, the results of the above works do not yield 
good algorithms for counting solutions to integer programs, as in this setting k is typically large 
(e.g., poly(n)). 

2 Notation and Preliminaries 

We use the following notation. 

1. For W G M"^^, e G R*-', ]C{W,9) denotes the polytope IC{W,9) = {x : W^x < 9}. We say a 
polytope IC(W,9) as above has k faces. 



Pr \(w,x) > t] 
x-e{-i,i}" 



Pr [{w,x) >t] < 0(e). 
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2. Unless stated otherwise, we work with the same polytope IC{W,9) and assume that the 
columns of the matrix W have norm one. We often shorten IC(W, 6) to /C iiW,6 are clear 
from context. We assume that k > 2. 

3. For A G ]R»"iX"^2^ j\T dg^otes the transpose of A and for p G [m2\, AiP denotes the p'th column 
of A. 

4. The all ones vector in M'^ is denoted by 1/;. 

5. For u G M'^, define rectangle Rect(ti) = (— oo,ni] x (— c>o,U2] x ••• x (— c>o,iifc]. Note that 
X e IC{W, 6) if and only if W"^x G Rect(^). 

6. AA" (where N = AA(0, 1)) denotes the standard multivariate spherical Gaussian distribution 
over M" with mean and identity covariance matrix. 

7. For a smooth function ijj : M^' M, let ||?/''-^^ Ill = sup { Yip q r s&[k] I dpdqdrdg V'(oi) • • • > o-k) \ ■ 
(ai,...,afc) gM'^}. 

8. We denote all universal constants by c, C, even when we have in mind different constants in 
the same equation. 

Definition 2.1 (regularity). A vector u G M"' is e-regular ifYi'^t — ^^ll'^^lP- ^ matrix W G M"^'^ 
is e-regular if every column of W is e-regular. A polytope /C = fC{W, 6) is e-regular if W is e- 
regular 

The main results of this paper are applicable to a large class of product distributions that satisfy 
the following two properties. 

Definition 2.2 (proper distributions). A distribution /i over M is proper if for X <— /x, E[X] = 0, 
E[X2] = 1 and ¥.[X^\ = 0. 

Definition 2.3 (hypercontractive distributions). A distribution fi over M is hypercontractive, if 
there exists a constant c such that the following holds. For any m, vector u G W^, and any q > 2, 

/ Nl/g / Nl/2 

(^^E^jK.,x)nj <cv-q[^E^j\{u,x)n'i . 

Two important examples of distributions that are proper and hypercontractive are the uniform 
distribution over the hypercube {1, —1}"' and the multivariate spherical Gaussian AA". 

We also use the following hypercontractivity inequality for degree d multilinear polynomials 
over the hypercube (see [O'D08]). 

Lemma 2.4 ((2, g)-hypercontractivity). For any q >2 and any degree d multilinear polynomial 
P : {1,-1}" ^ M, 

/ Nl/q / xl/2 

E [\P{x)\'i\] <q'l^ E [|P(x)n . 

^ "Regular polytopes" have a different meaning in combinatorics, but for the purpose of this paper, we will abuse 
notation and say a polytope is e-regular if it is formed by the intersection of e-regular halfspaces as in Definition 2.1. 
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3 Invariance Principle for Polytopes 

Our main invariance principle for polytopes IC{W,t) is as follows: 

Theorem 3.1 (invariance principle for polytopes). For any proper and hypercontractive distribution 
II over M there exists a constant C such that the following holds. For any e-regular k-polytope K,, 



Pr [X G /CI - Pr [Y G /C] 



< C (logS/5 k) (e \og{l/e)f'^. (3.1) 



The proof of the theorem can be divided into three parts. 

1. We establish an invariance principle for smooth functions on polytopes (Theorem 3.2) using 
an extension of Lindeberg's method; Section 4 is devoted to proving this part. 

2. We prove that for random variables A,B over M'^, closeness with respect to smooth func- 
tions and anti-concentration bounds for one of the variables imply closeness with respect to 
rectangles (Lemma 3.3). To do so, we use a nontrivial result of Bentkus [Ben90] on smooth 
approximations for the loo norm. 

3. We use a result of Nazarov [Naz03] on Gaussian surface area of polytopes to bound the 
Gaussian measure of "Zoo-neighborhoods" of polytopes in (Lemma 3.4). 

We begin by stating an invariance principle for smooth functions -0 : M'^ — > M. The proof is 
involved, making use of the randomized-blockwise-hybrid argument alluded to in the introduction. 
For clarity we present the proof in the next section (Section 4). 

Theorem 3.2 (invariance principle for smooth functions). For any proper and hypercontractive 
distribution fi over M there exists a constant C such that the following holds. For any e-regular W 
and smooth function ^ : M'^ — > R, 



E [i>(W'^X)] - E ['^(W'^Y)] 



<C||V(")||i(log3fc) {e log(l/e)). 



The following lemma shows that for two random variables A, i? over M'^, closeness with respect 
to smooth functions and anti- concentration bounds for the variable B imply closeness with respect 
to rectangles. Note that to use the lemma we do not need anti-concentration bounds for the random 
variable A. 

Lemma 3.3 (smooth approximation of AND). Let A, B be two random variables overM.'' satisfying 
the following conditions: 

• For all smooth functions i/; -.R^ -^R, \ E[tl;{A)] -E[^p{B)]\ < 

• For a function gi^ : [0, 1] — > [0, 1] the following holds: 

for all A G [0, 1], supg^^k {Pr[B G Rect((9 + Al^) \ Rect(6')]) < ^^(A). 

Then, for all 9 £ R'' , < X < 1, \ Pr[A G Rect(6')] - Pr[B G Rect(6l)] | < CAlog^ k/X'^ + Cgk{X). 

Finally, we use the following anti-concentration bound that follows from Nazarov's estimate on 
the Gaussian surface area of polytopes [Naz03] : 
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Lemma 3.4 (anti-concentration bound for Zoo-neighborhood of rectangles). For < A < 1, 



Pr^^ [W'^x G Rect(^) \ Rect(e - Al^)] = 0(A Vlogl). 

We first prove Theorem 3.1 using the above three results and then prove Lemmas 3.3 and 3.4 
in Sections 3.1 and 3.2. Theorem 3.2 is then proved in Section 4. 

Proof of Theorem 3.1. Let X ^ /u", Y ^ AA" and let random variables A = W'^X, B = W'^Y . 
Then, by Lemma 3.4 and Theorem 3.2, 

Pr[B e + Alfc) \R(0)] < CVbgfcA and \&[^{A)] -E[V'(S)]| < C (log^ A;) e log(l/e) HV^^^^Hi, 

where : IR'^ ^ is any smooth function, G R'^ and A G (0, 1). Therefore, by Lemma 3.3, for 

e G ]R^ 



|Pr[^ G Rect(0)] - Pr[S G Rect(0)]| < C (logH-) log(l/e)e/A^ + CTk^ A. 
The theorem now follows by setting A = (log"'^^''^'^ k) (elog(l/e))-'^/^. □ 

3.1 Smooth approximation of AND 

We now prove Lemma 3.3. For this, we use the following nontrivial result of Bentkus [Ben90] on 
smooth approximations for the Zoo norm. 

Theorem 3.5 (Bentkus [Ben90]). For every a > and < A < 1, there exists a function ip = 
V'a,A : M'' ^ M such that HV^^^^Hi < Clog^ k/X'^ and 



Ha) 



1 if \\a\\oo < a 

if \\a\\oo > a + A . 

^ G [0, 1] otherwise 



Corollary 3.6. For all u € R'' , < X < I, T > \\ 

^11 oo; there exists a function ijj = ipu,x,T 

such that II V'^^^ 111 < Clog^k/X"^ and 



1 if \/le [k],-T + ui <ai<ui 

if ^l & [k],ai > ui + X 

_ G [0, 1] otherwise 



Proof. Let V't/2,a be the function from Theorem 3.5 with a = T/2. Define ip = ipu,x,T : M'^ — > M by 

^u,x,T{ai,.. ■,ak) = ^T/2,A(ai + T/2 - ni,a2 + T/2 -U2,...,ak + T/2 - Uk). 
It is easy to check that ij: satisfies the conditions of the theorem. □ 

Proof of Lemma 5.5. Fix 61 G M'', < A < 1. Choose T G M large enough so that T > H^Hooi 
Pr[||A||oo > T] < A and Pr[||S||oo > T] < A. Let V : ^ 1^ be the function obtained from 
applying Corollary 3.6 to 0, A, T. Then, 

|Pr[^ G Rect(0)] - Pr[^ G RectT(e)]| < A, |Pr[5 G Rect(^)] - Pr[5 G RectT(e)]| < A, (3.2) 
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where RectT(6') = [-T + 61,61] x [-T + 02, 6'2] x • • • x [-T + 6k, 6k] C M*^. Observe that from the 
definition of ip in Corohary 3.6 and Equation 3.2 

Pr [A G Rect(0)] < E[i;{A)] + A < E[tp{B)] + A||^(^) ||i + A. 

Similarly, 

E[^p{B)] < Pr[B G Rect(6' + Al^)] = Pr[B £ Rect{6)] + Pr[B G Rect(6' + Al^) \ Rect(6')] < 

Pr[B eRect{e)]+gk{X), 

where the last inequality follows from the definition of gk- Combining the above two equations we 
get 

Pr[A G Rect(0)] < Pr[B G Rect(0)] + 2A||^(4) ||i + ^^(A) < Pr[B G Rect(0)] + ^^j^^' ^ + gfc(A). 

Proceeding similarly for the function ^/;/, : M'^ — > M obtained by applying Corollary 3.6 to 
t - Alfc, A,r, we get 

Pr [A G Rect(0)] > Pr G Rect(0)] - gk{X). 



Therefore, 



\Pr[A G Rect(0)] - Pr [i3 G Rect(0)]| < +gfc(A). 



□ 



3.2 Anti-concentration bound for /00-neighborhood of rectangles 

Lemma 3.4 follows straightforwardly from the following result of Nazarov [Naz03]. For a convex 
body K C with boundary dK, let T(K) denote the Gaussian surface area of K defined by 



T{K)= / e^da{y), 

Jv&dK 



where da{y) denotes the surface element at y G dK. 

Theorem 3.7 (Nazarov (see [KOS08, Theorem 20])). For a polytope fC with at most k faces, 
r(/C) < CVhgk. 

Proof of Lemma 3.4- Consider an increasing (under set inclusion) family of polytopes ICp for < 
p < A such that ICq = {x : W'^x G Rect(6l - Al^)} and JCx = {x : W'^x G Rect(6')}. Then, 



Pr ^ [W^x G Rect(0) \ Rect(e - Al^)] = / T{]Cp)dp < Cy^logk A, 



fA 
lp=0 

where the last inequality follows from Theorem 3.7. □ 
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4 Invariance principle for Smooth Functions over Polytopes 



We now prove Theorem 3.2. The proof of the theorem is based on Lindeberg's method. Let t = 1/e 
and let TC = {h : [n] ^ [t]} be a family of (2 log fc)- wise independent functions. That is, for all 
/ C [n],|/| < 2 log A: and b G [^]^ Pr^eM^i G h{i) =bi] = ^. 

We remark that to prove Theorem 3.2 we could take the hash family to be the set of all functions. 
However, we work with a (2 log /c)-wise independent family as the analysis is no more complicated 
and we need to work with such hash families while constructing pseudorandom generators. For 
S C [n], let Ws be the matrix formed by the rows of W with indices in S. Define 



1=1 



p=i 



1 4 log fc 



1 / log k 



Theorem 3.2 follows immediately from the following two lemmas. 
Lemma 4.1. For e -regular W , Tl{W) < C logk{e log(l/e)). 
Lemma 4.2. For any smooth function ip ^ M, 



E \i;(W^X)] - E \iIj(W^Y)] 



<A{\og^k)n{W)U^^^\\i. 



Proof of Lemma 4-1- Fix a I G [t], p G [A;]. For i £ [n], let Xj be the indicator random variable 
that is 1 if h{i) = I and otherwise. Then, Pr[Xi = 1] = 1/t and the variables Xi, . . . , Xn are 
(2 log /c)- wise independent. Further, 



4 



\^\h-^il)\\ 



i=l 



Let Yi be i.i.d indicator random variables with Pr[Yi = 1] = 1/t and let Zp = "^^^i Wf^i. Observe 
that Zp and Zp have identical d'th moments for d < 2 log A;. Moreover, by Hoeffding's inequality 
applied to Zp, for any 7 > 0, 



Pr 



>7 



< 2exp 



27^ 



< 2 exp 



=2exp(-2tV)- 



The above tail bound for Zp implies strong bounds on the moments of Zp by standard arguments. 
Setting 7 = -v/21og A; logt/t in the above equation, we get 



Pr 



\Zp\ > 



a/3 log k logt 



< 



1 



-j-2 log k ' 



Therefore, from the above equation and the fact that Zp < 1 



< 



^2 log k 

(41ogA: logt)'°s'= 

f-2 log k ■ 



\Zp\ > 



a/3 log k logt 
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Therefore, 



E 

hGuH 



IK- 



\h-m)\ 



1 4 log fc 



E 



(4 



/ N 2 log fe 



E 



^2 log k 



< 



(41ogfclogt)^°s'= 

f2 log k ■ 



Therefore, from the definition of Ti.(W) and the above equation, 

1 / log fc 



i=l \ p=l 



1 4 log k 



<t^^^^ = 4(log^)(.log(l/e)). 



□ 



The proof of Lemma 4.2 uses a blockwise hybrid argument and careful appUcations of hyper- 
contractivity as sketched in the proof outhne in the introduction. 

We use the following form of the standard Taylor series expansion. For a smooth function 
ip : M.^ ^ R, X e M'^ and pi,...,pr G [k], let dpj^^,,,^p^'ip{x) = dp-^dp^ ■ ■ ■ dp^ il){x). For indices 
pi,...,Pr G [k], let {pi,...,pr)l = si!s2! . . . Sfc!, where, for I G [A;], si denotes the number of 
occurrences oi I in (pi, . . . ,pr). 



Fact 4.3 (Multivariate Taylor's Theorem). For any smooth function - 
i/jix+y) =tp{x)+y2 dpilj{x)yp+ y2 —^dp^qi){x)ypyq+ V 7 — 



R, and x,y e R'', 
dp,q,r'^P{x) ypyqyr+en{j 



where |err(2;,y)| < HV''^^^ ||i • maxpg[fc] \yp\- 



Proof of Lemma Let X <— /i" and Y <— AA". We first partition [n] into blocks using a random 
hash function h Gu'H and then use a blockwise- hybrid argument. Fix a hash function h G TC. View 
X as X^, . . . ,X^, where each X^ = Xf^^i^i^ is chosen independently and uniformly from fi^^ ^'^L 
Similarly, view Y as y^, . . . ,y* where each = y/j-i^^^ is chosen independently and uniformly 

from prove the claim via a hybrid argument where we replace the blocks X^, . . . , X* 

with y^, . . . , y* one at a time. 



For < i < t, let be the distribution with Z\ 



X^ for i < J < t and 

1 < j < ^- Then, is distributed as /i" and is distributed as AA". For / G [t], let 



\h-Hj) 



Hi) 



yj for 



l/logfc 



h{W, I) 



HOI 



.p=i 



Claim 4.4. For 1 < / < and fixed h e H, 



xy 



3_ 

xy 



ijj{W^Z^~^) 



< 



C log^k\\iP^^'^\\ih{WJ). 



Proof. Without loss of generality, suppose that h~^{l) = {!,..., m}. Note that Z', Z''~^ have the 
same random variables in positions m + 1, . . . , n. Let Z'-^^ = {Xi, . . . , Xm, Zm+i, ■ ■ ■ , Z^) and 

= (yi, . . . , y^, Zm+i, • • • , Zn) where (Xi, . . . , Xm) is uniform over /i™ and (yi, . . . , ym) is uniform 
over AA™. Note that (Zm+i, ■ ■ ■ , Zn) is independent of (Xi, . . . , Xm), (yi, • • • , ym)- 

Let Wi G M™^*^ be the matrix formed by the first m rows of W and similarly let W2 G 
be the matrix formed by the last n — m rows of W. Lastly, let V = W^{Zm+i, ■ ■ ■ , Zn) and U be 
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one of X = {Xi, . . . , or y = (Yi, . . . , 1^). Now, by using a Taylor expansion of i/' at y as in 
Fact 4.3, 



i^iW^iUu ■■■,U^, Zm.+U ■■■,Zn))= ^{ W[U + V) 

+ E 



1 



{p,q,r) 



■ dp,g,rm) {Wf, U) {Wl U) {Wl U) + err(y, W^U). (4.1) 



p,g,rG[fe] 

Now, using the fact that ||2;||oo < ||-z||iogfc for z G 



1 / log fc 



\&vr{V,W^U)\ < llV'^^li •max|(W^f,C/)|^ < 

p€[k] 



1 

p=i 



41ogfe 



(4.2) 



E 

X 



Now, by hypercontractivity of fi, 

1/ log k 



< I E 

X 



p=i 



j2^[\{w!,x)\ 



1 / log A: 



1/logfc 



(by power-mean inequality) 



4 log A: 



k 



1/logfc 



< J](ClogA:)2i°g'=||H^ff i°g^ 
\p=i 

< C(log2 fc)/i(W,0. 



(by hypercontractivity of /i) 
(4.3) 



Similarly, by hypercontractivity of M, 



E 

Y 



1/log fc' 



,p=i 



< C{log'^k)h{W,l). 



(4.4) 



Since is proper, for any u^,u'^,u^ G M™, 

E[(ni, X)] = E[(^xi, y)], E[{u\X) {u\X)] = E[{u\Y) {u\Y)] 

E[{u\X) {u\ X) {u\X)] = n{u\Y) {u\Y) {u\ Y)]. 

From the above equations. Equations (4.1), (4.2), (4.3), (4.4) and the fact that X,Y,V are in- 
dependent of one another, it follows that |E [V'(VF^Z') - ^{W^ Z^-^)] \ < C{\og^ k) HV'^^) \\ih{W, I). 

□ 

Lemma 4.2 now follows from the above claim, summing from / = 1, . . . , and taking expectation 
with respect to h £uT~(-- D 
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5 Noise Sensitivity of Intersections of Regular Halfspaces 

We now describe how our invariance principle yields a bound on the average and noise sensitivity 
of intersections of regular halfspaces. We begin by defining the (Boolean) noise sensitivity of a 
Boolean function: 

Definition 5.1 (noise sensitivity). Let f be a Boolean function f : {1, — l}'^ — > {1, —1}. For any 

6 G (0,1), let X be a random element of the hypercube {1,-1}" and Z a 6 -perturbation of X 
defined as follows: for each i independently, Zi is set to Xi with probability 1 — 5 and —Xi with 
probability 5. The noise sensitivity of f , denoted NS5(/), for noise 6 is then defined as follows: 
N§,(/)=Pr [/(X)//(Z)]. 

Let /!,...,/'= : {1,-1}" ^ {1,-1} be halfspaces with fP{x) = 5\gn{{WP,x) - Op) and let 
f^^ : {1, -1}" ^ {1, -1} be their intersection, /^'^ = /i A A . . . A f''. 

Theorem 5.2. For /^^' e-regular, NS5(/^^) < C(logi-6(fc/(5)) (e^e +^i/2)_ 

We prove the theorem by first reducing bounding noise sensitivity of f^^ to bounding the 
Boolean volume of /oo-neighborhoods of polytopes. We then use our invariance principle, Theo- 
rem 3.1, to prove the required bounds on the Boolean volume of boundaries of polytopes. 

As mentioned before, the above theorem implies a n^°^°' ' ^ algorithm for learning intersections 
of regular halfspaces in the agnostic model for any constant error rate. 

We use the following tail bound that follows from Pinelis's subgaussian tail estimates [Pin94]. 

Fact 5.3. There exist absolute constants ci,C2 > such that all w G M'", t > 0, 

Pr [ > t\\w\\ ] < ci exp(-C2t^). 

The following claim says that for W e-regular, random x G„ {1, —1}", and a (^-perturbation y 
of X, W^x is close to W^y in l^o distance. 

Claim 5.4. For x G {1,-1}", let y{x) be a random 6 -perturbation of y{x) of x. Then, 

Pr [\\W^x-W^y{x)\\^>\]<25, 

x(i^{l,-l}-^ ,y{x) 

where A = C71og(fc/(5)V2<^i/2 + C\og{k/5f/^e^/'^ . 

Proof. Let Y = {Yi, . . . ,Yn) be i.i.d indicator variables with Pr\Yi = 1] = <5. Let S{Y) = 
support{Y). Now, for p E [k], ||VF|(y)f = Er=i ^p^i and E[||W^|(^)f] = 5. Further, since 
W is e-regular, by Hoeffding's inequality, for all t > 0, 



Pr 



\Wl,y,f-S\ > 7 



S{Y)\ 

Thus, by a union bound 

Pr [3pG [k], ||l^|(y)f >6 + 2^/\og{k/6)ej < S. (5.1) 
Note that for a fixed Y and sufficiently large C, by Fact 5.3 and a union bound, 

< 6. 



Pr 

a:Gn{l -1}" 



3PG W, \{Wl^y^,Xs^Y))\>CV^og{J^\\WP^y^\ 
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From Equation 5.1 and the above equation, we get that for a sufficiently large constant C 



Pr 



3pG [A:], \{Wl^yyXs(Y))\>C\og{k/5f/H^'^ + C\og{k/5fl'E^'^] < 25. (5.2 



Now, observe that that for x G {1,-1}", to generate a J-perturbation of x, y{x), we can first 
generate a random Y as above and flip the bits of x in the support of Y . Thus, from Equation 5.2, 

Pr [3p^[kl\{WP,x)-{W^,y{x))\>\]= Pr [3p e [A;], 2 , X5(y))| > A 

< 25, 

where A = Clog(A:/5)i/25V2 + C\og{k/5fl^e^l'^ . Therefore, 

Pr [\\W^x-W^y{x)\\oo>\]<25. 

□ 

The following claim can be seen as an anti-concentration bound for regular polytopes over the 
hypercube and could be of use elsewhere. 

Claim 5.5. For e-regular W G M"''^, 6 G M'^, and < A < 1, 

Pr [VF^xG Rect(0 + Alfc)\Rect(e-Alfc)] < C(logi-6 A:) (e log(l/e))i/^ + V^bilA. 

xGu{l,-l}" 

Proof. Follows directly from Theorem 3.1 and Lemma 3.4. □ 

We can now prove Theorem 5.2. 

Proof of Theorem 5.2. Note that for x,y G M", f^^{x) / f^^'iy) implies that W'^x G Rect(6' + 
7lfc) \ Rect(0 — 7lfc), where 7 = — VK"^y||oo- Hence, 

xe„{i,-i}",y 

< Pr [f^''(x) ^ f^''(y(x))\\\W^x-W^y(x)\\oo< X] + 26 (Claim 5.4) 

a;e„{l,-l}",y 

< Pr [T^^xG Rect(0 + Alfc)\Rect(0-Alfe)]+25 



< C(log^-*^ k) (e log(l/e))^/^ + y/hikX + 25. (Claim 5.5) 

The theorem now follows. □ 

6 Pseudorandom Generators for Polytopes 

We now prove our main theorems for constructing pseudorandom generators for polytopes with 
respect to a variety of distributions (Theorems 1.5, 1.6, and 1.7). 

The results in this section are based on a recent PRC construction due to Meka and Zuckerman 
[MZ09] for polynomial threshold functions using the invariance principle of Mossel et al. [MOO05]. 
A closer look at their construction reveals a general program for constructing PRCs from invariance 
principles. Given this observation, it is natural to ask if our invariance principle can be used 
to construct PRCs for regular polytopes. Indeed it can, and we use the Meka and Zuckerman 
generator but with a different setting of its parameters. The analysis, however, is a little more 
complicated in our setting (even given our invariance principle) and requires a careful application 
of hypercontractivity. 
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6.1 Main Generator Construction 



We begin by describing the construction of the PRG we use; it is a shghtly modified version of the 
PRG used by Meka and Zuckerman [MZ09] to fool regular halfspaces (i.e., the case k = 1). 

Give 5 e (0, 1), let e = n{6^/log^-^ k) be such that log^'^ k{e\og{l/e)Yl^ = 5. Let t = l/e and 
lei 7i = {h : h : [n] — > \t\} be a (2 log /c)-wise independent family of hash functions. That is, for all 
/ ^ N,|/| < 21ogfc and h G [t]^, 

^pyv,<,/.M.) = 6<l = i. 

Efficient constructions of hash families Ti as above with 17^1 = 0(n^^°§^) are known. To avoid 
some technical issues that can be overcome easily, we assume that every hash function h & 7i is 
equi-distributed in the following sense: for all j G [t], \{i : h{i) = j}\ = n/t. 

Let m = n/t and let Go : {0, 1}'^ {1, —1}™" generate a (41og k)-wise independent distribution 
over {1,-1}™. That is, for all / C [n],\I\ < 21ogA; and b G {1,-1}^, 

Pr [Vi el, Xi = hi] = -J^. 

Efficient constructions of generators Gq as above with s = 0{logk logn) are known [NN93]. 

Given a hash family and generator Go as above, we consider the following generator. Define 
G-.Hx ({0,1}")* ^ {1,-1}" by G{h,z^,...,z^) = x, where xih-i{i) = Go{z') for i G [t]. 

6.2 Pseudorandom Generators for Regular Polytopes 

We now argue that the generator G defined in the last section fools regular polytopes and prove 
Theorem 1.5. 

Proof of Theorem 1.5. The bound on the seed length of the generator G follows from the construc- 
tion. The following statement follows from an argument similar to that of the proof of Theorem 3.2: 
for any smooth function -0 : M'^ — > M and e-regular W , 



E mw^Giy))] - E [MW^Y)] 



< Glog3A:(elog(l/e)) U'^^'^Wi. (6.1 



Indeed, to observe that Lemma 4.1 holds for any (2 log A;)-wise independent family of hash functions 
and the proof of Lemma 4.2 relies only on two key properties of X <— /i": (1) For a fixed hash 
function h, the blocks ^/i-i(2)) • • • i ^/i-i(t) independent of one another. (2) For a 

fixed hash function h, and j G [t], the distribution of each block X/^-iq) satisfies (2, 2 log A;)- 
hypercontractivity for all j G [t\. In other words, we used the property that for all j G [t], 

E[|(n,X^-i(j))|4i°g'=] < (GlogA;)2i°g'= ||nf (6.2) 

Note that X generated according to the generator G satisfies both the above conditions: 1) For a 
fixed function h, the blocks are independent by definition and 2) the hypercontractivity inequality 
6.2 only involves the first (4 log A;)-moments of the distribution of X^-i^j). As a consequence, 
inequality 6.2 holds for any (4 log A;)-wise independent distribution over {1, —1}!'* 

We can now move from closeness in expectation to closeness in cdf distance by an argument 
similar to the proof of Theorem 3.1, where we use Equation 6.1 instead of Theorem 3.2, to get 

I Pr [G(y) G /C] - Pr [y G /ClI < <5. 
The theorem now follows from the above equation and Theorem 3.1. □ 
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6.2.1 Approximate Counting for Integer Programs 



The PRG from Theorem 1.5 coupled with enumeration over all possible seeds immediately implies a 
quasi-polynomial time, deterministic algorithm for approximately counting, within a small additive 
error, the number of solutions to "regular" {0, l}-integer programs. It turns out that "regular" 
integer programs correspond to a broad class of well-studied combinatorial problems. For example, 
we obtain deterministic, approximate counting algorithms for dense set cover problems and {0, 1}- 
contingency tables. We obtain quasi-polynomial time algorithms even when there are a polynomial 
number of constraints (or polynomial number of rows in the contingency table setting). As far as 
we know, there is no prior work giving nontrivial deterministic algorithms for counting solutions to 
integer programs with many constraints. 

Here we discuss the case of dense set cover instances and remark that we get similar results for 
the special case of counting contingency tables. Covering integer programs are a fundamental class 
of integer programs and can be formulated as follows. 



where the coefficients of the constraints aij and Cj are all non-negative. An important special 
class of covering integer programs is set cover, which in turn is a generalization of many important 
problems in combinatorial optimization such as edge cover and multidimensional {0, l}-knapsack. 

In the standard set cover problem, the input is a family of sets Si , . . . , 5^ over a universe U of 
size k and an integer t. The goal is to find a subfamily of sets C such that \C\ < t and the union 
of all the sets in C equals U. This corresponds to a covering program as above with k constraints 
and n unknowns from {0, 1}. Call an instance of set cover e-dense if each element in U appears in 
at least 1/e^ of the different sets Si. It is easy to verify that with this restriction, after translating 
from {0, 1} to {1, —1} and appropriate normalization, all the linear constraints in the corresponding 
integer program as in Equation 6.3 are e-regular. Thus, using the generator from Theorem 1.5 and 
enumerating over all seeds to the generator, we have the following: 

Theorem 6.1. There exists a deterministic algorithm that, given instance of an e-dense set cov- 
ering problem with k constraints over a universe of size n, approximates the number of solutions to 
within an additive factor of 5 in time n^°^^^^°?'^^^/^) as long as e < 6^/{\og^'^ k)(\og{l/6)). 

We now briefly elaborate on approximately counting the number of {0, 1} contingency tables. 
The problem of counting {0, l}-contingency tables is the following. Given, positive integers n, k 
n > k, r = {ri, . . . ,r„) S Z", c = (ci, . . . ,Cfc) G we wish to count the number of solutions, 
CT(r,c), to the following integer program whose solutions are matrices X S {0,1}"^'^ with row 
and column sums given by r, c. 



mm 





(6.3) 



AE {0,1} 



n 



Find X G {0, 1} 



nxk 




ij — Cj) 1 < J < fc. 
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Observe that, after translating from {0, 1} to {1, —1} and appropriately normalizing, solutions 
to the above integer program correspond to points from {1,— 1}""^*^ that lie in an intersection of 
2(n + A;)-halfspaces each of which is -regular (recall that the notion of regularity does not 

depend on the value of the Cj's or r^'s). Thus, as with dense instances of set cover, we can use 
Theorem 1.5 to count the number of {0, l}-contingency tables: 

Theorem 6.2. There exists a deterministic algorithm that on input r G Z", c G Z'^, approximates 
CT(r, c)/2"'^, the fraction of {0,1} -contingency tables with sums r,c, to within additive error 6, 
and runs in time 77,poiy{iogfc.i/5) . 

We remark that using results of Wolff [Wol07], who shows hypercontractivity for various discrete 
distributions, we can approximately count number of solutions to dense set cover instances and 
contingency tables over most natural domains. 



6.3 Pseudorandom Generators for Poly topes in Gaussian Space 

We now prove Theorem 1.6. We use an idea of Ailon and Chazelle [AC06] and the invariance of 
the Gaussian measure to unitary rotations to obtain PRGs with respect to AA" for all polytopes. 
Similar ideas were used by Meka and Zuckerman to obtain PRGs for spherical caps (i.e., the case of 
one hyperplane). In our setting, we must prove that, with respect to a random rotation, all of the 
bounding hyperplanes become regular with high probability. Such a tail bound requires applying 
hypercontractivity. 

Let H G M"'^"' be the normalized Hadamard matrix with Hij G {l/y/n,—l/y/ri}. Ailon and 
Chazelle show that for any w G M", and a random diagonal matrix D with uniformly random {1,-1} 
entries, the vector HDw is regular with high probability. We derandomize their observation using 
hypercontractivity. For a vector x G M", let D{x) G M'"^" be the diagonal matrix with diagonal 
entries x. 

Lemma 6.3. There exists a constant C > such that the following holds. For any w G M", 

= 1 , < S < 1 and any [C\og{k/6))-wise independent distribution T) over {1, —1}", 

Vx^{\\liD{x)w\\\ > Clog^{k/6)/n] < 6/k. 

Proof. Fix awG and a C \og{k / 6)-wise independent distribution T> for constant C to be chosen 
later. Let random variable Z = \\HD{x)w\\f = Huxiwi)"^ for x <— D. Note that x satisfies 

(2, g)-hypercontractivity for q < Clog{k/6). Now, 



< 



ixiwi 



H'Xi'Wi 



E 



ixiwi 



•E 



E^. 



iXlWl 



< J^S^ E 



c4 



E^' 



El c 



E 



jixm 



Cauchy-Schwarz inequality 



(2, 8)-hypercontractivity 



17 



Observe that Z is a degree 4 multilinear polynomial over xi, . . . , Xn- Therefore, by (2, g)-hypercontractivity, 
Lemma 2.4, applied to the random variable Z, for q < C\og{k/S)/4:, 

^9/2 „2q 

E[|Z|«] < < J . 

Hence, by Markov's inequality, for 7 > 0, 



Pr[|Z| > 7] = Pr[|Z|'' > 7*?] < 



'^1/2 g2y 



7n / 

The lemma now follows by taking q = 2\og{k/5) and 7 = 2c^/^ g^/n. □ 

Let G : {0,1}^ {1,-1}" be the generator from Theorem 1.5 for r = 0((lognlog A;)/e). 
Let Gi : {0,1}''^ {1,-1}" generate a Clog -wise independent distribution, for constant 
C as in Lemma 6.3. Generators Gi as above with ri = 0(log(A;/(5) log n) are known. Define 
Gm ■■ {0, ly^ X {0, l}*" ^ as follows: 

G^ix,y) = D{Giix))HGiy). 

We claim that Gj\f (5-fools all polytopes with respect to M'^. 

Proof of Theorem 1.6. Recall that e = ^l[5^^^ / \o^'^ k) > . The seed length of Gj\^ is ri + r = 

0(lognlogA;/e). Fix W G M"^". Observe that W^G^ix^y) = {HD{Gi{x))WfG{y). Now, from 
Lemma 6.3 and a union bound it follows that 

Pr [HD{Giix))W is not e-regular] < 6. (6.4) 

a;eu{0,l}''i 

Further, from the invariance of AA" with respect to unitary rotations, for any x G {0, 1}^^ , 
Pr [{HD{Gi{x))Wfz £ Rect{9)] = Pr [W^z e Rect(0)l. 

Thus, from Theorem 1.5 applied to M, we get that for HD{Gi{x))W e-regular, 

I Pr ^^^[{HD{Gi{x))WfG{y) G Rect(e)] - Pj^jW^z G Rect{9)]\ < S. (6.5) 

The theorem now follows from Equations (6.4), (6.5). □ 

6.4 Pseudorandom Generators for Intersections of Spherical Caps 

Theorem 1.7 follows from Theorem 1.6 and the following new invariance principle for polytopes 
over S^~^: 

Lemma 6.4. For any polytope fC with k faces, 

C log n log k 



Pr [X en- Pr \Yly/^ £ IC] 



< 



n 



The proof uses Nazarov's bound on Gaussian surface area and the following classical large 
deviation bound for the norm of a random Gaussian vector (for a nice exposition of the bound see 
[Tao09]) 



18 



Lemma 6.5. For Y ^ A/"", 

Pr[|||y|| - > t] < aexp(-6t2)^ 
where a,b > are universal constants. 

Proof of Lemma 6.4. Fix a polytope JC{W,9). Let X E„ 5"-^ and Y ^ Af^. Note that Y/\\Y\\ is 
uniformly distributed over S'^~^. Fix 5 = cjvi}!'^ for a constant c to be chosen later. Observe that 
for Y <— AA", and u G M", ||u|| = 1, {u^Y) is distributed as M . Hence, for any u G M"", \\u\\ = 1, 

Pr[Kn,y)|>^log(fc/<5)]<|. 

Therefore, by a union bound, 

Pr[ \\W^Y\\^/V^ > y/\og{k/6)/V^] < 5. 
Further, by using Lemma 6.5 and the fact that y/||y|| is uniformly distributed over S"*^"^, 

Pr[\\W'^X\\oc > ^/Clog{k/6)/V^] < 26, 

for a sufficiently large constant C. From the above two equations, it follows that to prove the 
theorem we can assume that 

ll^lloo < y'Clog{k/6)/n. 
Now, from Lemma 6.5 and the above equation it follows that 

Pr[|||y|| - ^\ II^IU > VClog(l/5)log(A:/(^)/n] < 6. (6.6) 

Let A = ^Clog(l/(5) log{k/6)/n. Then, since G„ 5"-^ 

I Pr[X G /C] - Pr[Y/^/^ ^m = \ Pr[W^X G Rect(^)] - Pr[W^Y/^/^ G Rect(^)] | 

= I Pr[W'^Y G ||y|| Rect(0)] - Pr[W^Y G ,/^Rect{e)] \ 

< Pr[|||y|| - V^l ll^lloo > A] 

+ Pr[ W^Y G Rect(y^0 + Alfc) \ Rect( - XU) ] 

< 5 + 0( Aylogfc ). (Equation 6.6, Lemma 3.4) 

The lemma now follows by choosing 5 = cjn}!'^ for a sufficiently large constant c. □ 

Proof of Theorem 1.7. Define Gsp : {0,1}''' x {0,1}'^ ^ 5""^ by G,p(x,y) = Gx{x,y)/.Jli. It 
follows from Theorem 1.6 and Lemma 6.4 that Gsp fools polytopes over S*""^ as in the theorem. □ 
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